CRISP-DM Twenty Years Later: From Data Mining Processes to Data Science Trajectories
نویسندگان
چکیده
CRISP-DM(CRoss-Industry Standard Process for Data Mining) has its origins in the second half of nineties and is thus about two decades old. According to many surveys user polls it still de facto standard developing data mining knowledge discovery projects. However, undoubtedly field moved on considerably twenty years, with science now leading term being favoured over mining. In this paper we investigate whether, what contexts, CRISP-DM fit purpose We argue that if project goal-directed process-driven process model view largely holds. On other hand, when projects become more exploratory paths can take varied, a flexible called for. suggest outlines such trajectory-based might look like how be used categorise (goal-directed, or management). examine seven real-life exemplars where activities play an important role compare them against 51 use cases extracted from NIST Big Public Working Group. anticipate categorisation help planning terms time cost characteristics.
منابع مشابه
CRISP-DM: Towards a Standard Process Model for Data Mining
The CRISP-DM (CRoss Industry Standard Process for Data Mining) project proposed a comprehensive process model for carrying out data mining projects. The process model is independent of both the industry sector and the technology used. In this paper we argue in favor of a standard process model for data mining and report some experiences with the CRISP-DM process model in practice. We applied an...
متن کاملData Mining The Original Data Warehouse: Twenty-Five Years And A Million Lines of SAS Later
The author of MXG Software provides a historical perspective of how and why the SAS System became the pervasive tool for managing and mining of the original data warehouse, the Performance Data Base built from SMF data. The architecture of the MXG implementation is described to show how MXG, currently 911,749 lines of SAS code in 3,011 files (members), executes under MVS, VM, UNIX, OS/2, Window...
متن کاملSpecializing CRISP-DM for Evidence Mining
The use of all forms of computer and communication devices is changing human interaction and thinking. Electronic traces of actions and activities are continually being left behind most often unknowingly so. This situation creates opportunities for criminal investigators to make use of these traces and marks to uncover evidence. In this evidentiary discovery process several problems are experie...
متن کاملUsing Data Mining for Bank Direct Marketing: an Application of the Crisp-dm Methodology
The increasingly vast number of marketing campaigns over time has reduced its effect on the general public. Furthermore, economical pressures and competition has led marketing managers to invest on directed campaigns with a strict and rigorous selection of contacts. Such direct campaigns can be enhanced through the use of Business Intelligence (BI) and Data Mining (DM) techniques. This paper de...
متن کاملInstantiation and adaptation of CRISP-DM to Bioinformatics computational processes
Among the many contributions made by information technologies to Bioinformatics, the techniques of intelligent data analysis combined with optimization techniques are the main application field nowadays. Many researches focused on DNA microarray field have proposed different approaches trying to obtain new undiscovered knowledge of diseases such cancer. All these researches can be represented a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Knowledge and Data Engineering
سال: 2021
ISSN: ['1558-2191', '1041-4347', '2326-3865']
DOI: https://doi.org/10.1109/tkde.2019.2962680